careful evaluation and numerous suggestion
Author Response: We thank all our reviewers for their careful evaluation and numerous suggestions
Author Response: We thank all our reviewers for their careful evaluation and numerous suggestions. We agree that evaluating on just in-domain sentences is a limitation. Recoverability estimates are nearly perfect for both the large and medium models. Our results and conclusions hold only for English, and this is a limitation of the work. We will add this to the paper.